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Sequence Range: 1 co 5864 



>BglII 
I 

10 | 20 30 40 SO 60 70 80 90 100 

GACGGATCGGGJVOXTCTCC CGATCC C CTATGGTCGACTCTCAGTACAATCTGCTCTGATGC CGCAT AGTTAAGCCAGT ATCTGCTCC CTGCTTGTGTGTT 
CTGCCTAGCCCTCTAGAGGGCTAGGGGATACCAGCTGAGAGTCATGTT 

>MfoI 

I 

1L0 120 130 140 150 160 1 170 180 190 200 

GGAQOTCOCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAOOCAAO^ 
CCTCCAGCGACTCATCACGCGCTCGTTTTAAATTCGATGTTGTTCCGTTCC^ 

>Sp«I 

I 

>CMV_promoter | 

I I 

210 220 230 240 250 260 270 280 290 300 

CTGCTTCGCGATGT ACGGGCCAGATATACGCGTTG C CAT AT A 

GACGAAGCGCTACATGCCCGGTCTATATGCGCAACTGTAACTAATAACTGATCAATAAT^ 



310 



320 



330 



340 



350 



360 



370 



380 



390 400 
ATCTTCCCATAGT 



>NdeI 

I 

410 420 430 440 450 460 470 480 | 490 500 

AACGCCAATAGG<jACTTTCCATTGACGT<rAATG 

TTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTGATAAATGCCATT^ 

>SnaBI 
I 

510 520 530 540 550 560 570 580 590 600 

CCTATTGAOSTCAATGACGGTAAATGGCCCGCCTGGCACTATGCCCAGTAC^ 
GGATAACTGCAGTTACITCCATTTACCGGGCGGAC 

610 620 630 640 650 660 670 680 690 700 

tcgctactaccatggtgat<x:ggttttggcagtacatcaa 

agcgataatx3gtaccactacgccaaaaccgtcatctagttacccgcacctatcgccaaactgagtgcc 

710 720 730 740 750 760 770 780 790 800 

T<3GGA ^ " 1 " 1^T1 M I " 1\3 GCACCAAAAT^ 

ACCCiraJUVCAAAACCGTGGTTTTAGTTGC^ 

>SacI 
I 

>Ecll36IZ 
I I 

| | >T7_promoter 

I I I 
810 . | 820 830 840 850 860 | 870 880 890 900 

GTCTATATAAGCAGAGCTCTCTGGCTAACTAGJU3AACCCACTGCT 
CAGATATATTCGTCTCGAGAGACCGATTGATCTCTT GG GTGAGGAATGA^ 



>HindlII 
I 

> Ko z ak_s equenc e 
I I 
>HindIII_linker_[ Split] 



>PflMI 



>BamHI 



j| 910 920| 930 940 950 960 970 980 990 1000 

TAAGCTTGCO^CJ\CCATCGACTGG^ 

ATTCGAACGGCGGTGGTACCTQACCTGGACCTAGGACAAGGACCA 

MDWTWILPLVAAATRVHS> 
_SIGEHU > 

SKKPGGPGK S> 
WNVCHU > 

910 920 930 903 TO 1335 OP PCV/HB. SIGEHU-WNVCHU_0 980 990 1000> 

1 0 2 0 3 0_ 



_10_ 



5 TO 437 OP 1 . SIGEH-WNVCHU 70_ 

_1 TO 54 OF SIGEHU 40 50 



_80_ 



_90_ 



>ApaI 

I 

>Bspl20I 
I I 

1010 1020 1030 1040 1050 1060 1070 1080 1090 | 1100 

CGCGCCGTGAACATGCTGAAGCGCGGCAT<X:CCCG 
GCGCGGCACTTGTACGACTTCGCGCCGTACG^^ 

RAVHMLRRGMPRVLS LIGLKRAMLS L I DGKGP I> 
. : WNVCHU ■. ; 



_1010_ 
_110 



_1020_ 
_120 



__1030 903 TO 1335 OF PCV/HB . SIGEHU- WNVCHU_70_ 

_130 5 TO 437 OP 1 . SIGEH-WNVCHU 170 



— 1080_ 
_180 



_1090_ 
_190 



_1100> 
_200 > 



>SfiI 



1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 

CGAAGCACGACCGGOACGACCGOAAGAAGGCGAAG^ 

RFVLALLAPPRPTAIAPTRAVLDRWRQVNKQTAM> 
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.WNVCHU_ 



-Ji 10 -l\ 20 1130_903 TO 133S OF PCV/HB . SIGEHU-WNVCHU 7Q n fln ,,,7 120a 

.210 220 230 5 TO 437 OP 1 . SIGEH-WNVCHU 27?_ZZ3eo!!Z^90__Z~lio 




1210 -1220 1230_903 TO 133S OP PCV/HB. SIGEHU-WNVCHU 70 i 2fl n non TTno 

- 310 320 330 5 TO 437 OP 1 . SIGEH-WNVCHU 37 0_ZZ380_ZZ39 0_~Ioo!! 

>BStEII 
i 

>PaeR7I j 

I I 

>NotI >XhoI | >BstBI 
Ell I 
>NotI_introduction_ [Split J >VS epitop* 

II II I ~ | 

1320 1330 || 1340| 1 1350 |1360| 1370 




_903 TO 1335 OF PCV/HB". SIGEHU. 
S TO 437 OF 1 . SIGEH-WNVCH U ' > 

>AgeI >PmeI 

I I 
>Polyhistidine_tag | >BGH_polyadenylation_signal 

11410 | 1420 1430 |1440 1450 | 1460 1470 1480 man 1 =nn 

GCGCATGGCCAGTAGTAGTGOTAGTGGTAACTCAAAT^^ 

1510 1520 1530 1540 1550 1560 1570 1580 ison 

>BbsI 

1610 1620 1630 1640 1650 1660 1670 1680 1690 nnn 

CCCCCACCCCACCCC^TCCTGTCGTTCCCCCTCCTAACCCTTCTGT^ 

1710 1720 1730 1740 1750 1760 1770 1780 1790 ifloo 

CCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCG 
(3GTCGACCCCC 3 AGATCCCCC A TAGGG< 3 TGCGC<^ 

«KC CT AGCGCCC^^ 1890 
GCGGGATCGCGGGCGAGGAAAGCGAAAGAAGGGAAGGA^ 

>DraIII 

1910 1920 1930 1940 1950 l96o! 1970 1980 1900 onnn 

GCTAAATCACGAAATGCCGTGGAGCTGGGGTTTTTT^ 



2010 2020 2030 2040 2050 2060 



2070 2080 2090 2100 



G<^CCTCAGGTGCAAGAAATTATCACCTGAGAACA^ 

2110 2120 2130 2140 2150 2160 2170 2180 2190 „« ft 

CTAAAGCCGGATAACCAATTTTTTAC^ 

22 i*_ 2220 2230 2240 2250 2260 2270 2280 2290 2^oo 

^GAOOGGTCOGTCCGTCTTCATACXrrTTCGTA 

2310 2320 2330 2340 2350 2360 2370 2380 2390 , 4n n 

AAOTCCGCCCAXCCCGCCCCTAA)CTOCGCCCAiGTTCCGCCCAT*PCTCCGC CCCATC3G^TG 

C^ACGTAGAGTTAATCACT^^ 

>AvrII 

I 

>StuI 

2410 242 0 2430 2440 2450 2460 2470 2480 !! 2490 2son 

ACTAATTTTTTTTATTTATGCAGAGGCCGAGGCC^ 
TGATTAAAAAAAATAAATACGTCTCCGGW 

>SmaI 
I 

>x »*f >BsaBl 
II | 

12510 2520 2530 2540 2550 | 2560 2570 2580 2590 2600 

TTCGAGGC^CCTCGAACATATAGGTAAAAGCCTAGACTAGTTCTCTG 
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>£hel 
I 

>NarI 

I I 

>Kaal 

CGGCCGCTTCXSgTOGAGAGOCTATTCO^AT^ 

GCCGGCGAACCCACCTCTCCGATAAGCCGATACTGACCCGTGTTGTCTGTOAGCC^ 



2730 



2740 



>PstI 



2750 



2760 



>M3CI 



>TthlllI 
I 

2810 | 2820 2830 2840 28S0 2860 2870 5««o o aa « 

2910 2920 2930 2940 2950 



2790 



2960 



2980 



TTCATAGGTAGT ACCGACTACGTTACGCCGCCGACGTATGCGAACTAGGCCGATGGACGGGTJ 



TAGCGTAGCTCGCTCGTGC 



3010 



3020 



TACTCCX^TGGAAGCCrc^ 

ATGAGCCTACCTTCGGCCAGAACAGCTAtTPCCTACTAGACCTGCTTCTCGTAGTCCCCGAGCGCGG 

cccg^g<^gaggatc^ctcgtgacc^^ 

GGGCTGCCGCTCCTAGAGCAGCACTGGGTACCGCTACGGACGAACGGC^ 



TACCGGCGAAAAGACCTAAGTAGCTGACACCGGCCG 



>RsrII 



ACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTATAACGACTTCTCGAACC 



3560 



3580 



3590 



>BsaI 
I 

3620 



lTGTTTATTTCGTTATCCTAGTGTTTAAAGTGTTTAT 
>Bstll07I 

TCAOTO^AACTCA^WAATTG^^ 3850 IM0 3870 3880 389 ° 

CCGCCATTATGCCAATAGOTOTCTTAOTCCCCTATTGCGTCCTTTCTTGTACACTCGTTTTCCGOTC^ 



4110 



4120 



4130 



4140 



4150 



4160 



4210 4220 4230 4240 



4180 



4250 



4260 



4330 



4340 



4360 
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4510 4520 4530 4540 4550 4560 4570 4580 4590 4600 

GTGCTACAGAGTTCTTGAAGTCXrra^ 
CACGATGTCTCAAGAACTTCACCACCGGATO 



4610 4620 
AGTTGGTAGCTCTTGAT 
TCAACCATCQAfiAACTA 



4630 



4640 4650 4660 4670 4680 4690 4700 

ATTACGCGCAGAAA AAAAG GATCTCAAGAA 
ACCATCGCCACCAAAAAAACAAACGTTCGTCGTCTAATGCGCGTCTTTTTTTC CT AGAGTTCTT 

4710 4720 4730 4740 4750 4760 4770 4780 4790 4800 

GATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAAC^ 
CTAGGAAACTAGAAAAGATGCCCCAGACTGCGAGTCACCTTGCTTTT^^ 

4810 4820 4830 4840 4850 4860 4870 4880 4890 4900 

TCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAA 
AGGAAAATTTAATTTTTACTTCAAAATTTAGTTAGATTTCATAT^ 

>AhdI 

4910 4920 4930 4940| 4950 4960 4970 4980 4990 5000 

AGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACT^ 



5010 5020 5030 5040 5050 5060 5070 5080 5090 5100 

ATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCC^ 
TATGGCGCTCTGGGTGCGAGTGGCCGAGGTCTAAATA^ 

5110 5120 5130 5140 5150 5160 5170 5180 5190 5200 

CCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT 
GGT AGGTCA^TAATT AACAACGGCCCTTCGATCTCATTCATCAAGC 

5210 5220 5230 5240 5250 5260 5270 5280 5290 5300 

ACGCT^GTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAA 
TGCGAGCAGCAAACCATACCGAAGTAAOTCGAGGCCAAGGGTTGCTAGTTCCGCTCAATGTACT 

>PVUl 

53101 5320 5330 S340 5350 5360 5370 5380 5390 5400 

GGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTG^ 
CCAGGAGGCTAGCAACAGTCTTCATTCAACCGGCGTCACAATAGTGAGT^ 



>ScaI 

5410 S420| 5430 5440 5450 5460 5470 5480 5490 5500 

ACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTCT 

CGAAAAGACACTGACCACTCATGAGTTGGTTCACT 

5510 5520 5530 5540 5550 5560 5570 5580 5590 5600 

GCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACCTTC 
CGGTGTATCGTCTTGAAATTTTCACGAGTAGTAACCTTTTGC^ 

5640 5650 5660 5670 5680 5690 5700 

ATGC CGCAAAAAAGGGAAT AA 
CCGTTTTACGGCGTTTTTTCCCTTATT 



5610 5620 5630 

CCCACTCOTGCACCCAACTGATCTTCAGCA' 
GGGTGAGCACGTGGGTTGACTAGAAGTCGTAGAAAA' 



>SspZ 

I 

5710 5720 5730 5740 | 5750 5760 5770 5780 5790 5800 

GGGCGACACCGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTA 
CCCGCTGTGCCTTTACAACTTATGAQTATGAGAAGGAAAAAGTTATAATA^ 



5810 5820 5830 5840 58S0 5860 

TATTTAGAAAAATAAACAAATAGOGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
ATAAATCTTTTTATTTGTTTATCCCCAAGGCGCGTGTAAAGOGGCT^^ 



r 

In 
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Sequence Rang*: 1 to 5864 



>Bglll 

I 

10 I 20 30 40 50 60 70 80 

GAC(XaTCGC<^QATCltXV^TCCCCTATGgrCGACTCTC^ t , 

CTGCCTAGCCCTCTAGAGGGCTAGGGGATACCAGCTGAGACTCATGl^ 

>MfeI 

110 120 130 140 150 1601 170 180 190 5nn 

CCTCCAGCGACTC\TCACGCGCTC<*ITTTAAAT^ 



230 



>SpeI 

I 
I 
I 

250 



>CMV_prcanocer 

I 



310 



320 330 340 350 360 370 iao ian 

TGGACTTCCCCGTTACATAACTTJWS^AAATO^^ 



410 



420 



430 



>NdeI 



>SnaBI 

510 520 530 540 550 560 570 5A0 *ai 

GGATAACTGCAGTTACTGCCATTTACCGGGC^ 

TCGCTATTACCATGGTCATOCGGTTTTGGCACTA 

AGCGATAATOGTACCACTACCCCAAAACCCTCATCyrAOTTACCCGCACCTAT^ 

TGGCAGTOTGTTTTCKX^CCAAA^ 
AC^CTCAAACAAAACTOTtX^W 

>SacI 

I 

>Ecll36lI 
I I 

! ! >T7_prorootor 

I I I 

810 | 820 830 840 850 860 I 870 flfln aon 

<nCTATATAAGCAGAGCTCTCTGOTA^ 
CAGATATATTCGTCTCGAGAGACCGAr^ 



>HindIII 

I 



>P£lMI 



>Kozak.sequenc« 

I I 
>Hindl 1 1.1 inker. [ Spl i t ] 

II I 

TAACCl^SocC^ 970 980 *>° 1000 

KOWTWZ LPLVAAATRVH S> 
— . SIGBY__ > 

SKKPGGPGKS> 

T 910 r? 20 _«0_903 TO 1335 OP A. PCV/HB. SIGgY-WHVCHU 0 9fl0 """"^ gn _,nnn > 

10 20 30 4_5 TO 437 OP SIGBY-WNVCHU* 70 B O Q Q Ton > 



_10_ 



.1 TO 54 OP SIGEY_ 



_40_ 



_50_ 




>ApaI 

I 

>Bspl20l 

I I 

1070 1080 1090 j 1100 

^TGCTGAGCCTGATCGACGGCAAGGGCCCCATAC 
, ATQ 

D G K G P Z> 



1010. 

.110 



_1020. 
120 



__1030_903 TO 1335 OP A. PCV/HB. SIGBY-WNVCHU 70 
.130 1_5 TO 437 OP SIGBY-WNVCHU* 170 



_180_ 



1100> 

.200 > 



>SfiI 

I 

1110 I"© 1130 1140 1150 1160 

<*WV/tt^T(*XXCTGCTGGCCr ^ ^ 

CGAAGCACGACCGGGACGACCGGAAGAAGGCGAAGTGGCGCTAA'^**'* " 

RPVLALLAPPRPTAI 
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4510 4520 4530 4540 4550 4560 4570 4580 4590 4600 

GTGCTACAGACTTCTTGAACTASGTGGCCTAACTACTCCT^ 
CACGATgTCTCJUUMU^CTTCACaWCCG^ 

4610 4620 4630 4640 4650 4660 4670 4680 4690 ^^Jt700 

4710 4720 4730 4740 4750 4760 4770 4780 4790 4800 

GATCCTTTG A ' ll.lVlMl.lA CCGGCTCTGACGCTCACTGGAACGAAAACTCAC(rrT 
CTAGGAAACTAGAAAAGATGCCCCAGACTGCGAgrCACLl'iW 

4810 4820 4830 4840 4850 4860 4870 4880 4890 4900 

TCCTTTTAAATTJUUkAATGAAGTTTTAAATCAATCTAA 
AGGAAAATTTAATTTTTACTTCAAAATTTACTT^ 

>AhdI 

I 

4910 4920 4930 4940 | 4950 4960 4970 4980 4990 5000 

AGC<^TCTGTCTAl'i*lSJG"fTGATCCATAGTT 

TCGCTAGACAGATAAAGCAAGTAGCTATCAACGGJUrrGAGGGGC^ 

5010 5020 5030 5040 5050 5060 5070 5080 5090 5100 

ATACCGCGAGACCCACGCTCACCCGCTCC^ 
TATGGCGCTCTGGCTGCGAGTGGCCGAGCTCTAAAT^ 

5110 5120 5130 5140 5150 5160 5170 5180 5190 5200 

CCATCCACTCTATrAATTGTTGCCGff5AAGCTAGACTAACT 

GOTAOGTCAGATAATTAACAACGGCCCTrCGATCTCATPCATCAAGC(X3TCAATTATCAA 

5220 5230 5240 5250 5260 5270 5280 5290 5300 

TATGGCTTCATTCAQ l 'l SX t ^ 'ri XJ S G AACGATCA 
TGCGAGCAGCAAACCATACCGAAGTAACTCGACCCCAAC^ 

>PvuI 

I 

53101 5320 5330 5340 5350 5360 5370 5380 5390 5400 

GGTCCTCCGATCXrrrqTCAGAAGTAAGTTGGCCtXA tf 
CCAGGAGGCTACauU^AGTCTTCATTCAACCGG^^ 

>Scat 

I 

5410 54201 5430 5440 5450 5460 5470 5480 5490 5500 

G CnH ' l^ n ^^ ACTCCTGACTACTCAACCAAgrC^ 
CGAAAAGACACTGACCACTCATGA WiUX^ TT CA CTAAG 

5510 5520 5530 5540 5550 5560 5570 5580 5590 5600 

GCCACATAGCACJU^CTTTAAAACTGCT CA TCATTGGAAAA ^ 

5610 5620 5630 5640 5650 5660 5670 5680 5690 5700 

CCCACTC<7K*yiCCCAAC7X^TCTT^^ 
GGGTGAGCACCTOGGTTGACTAGAAgrCCTAGAAAA 

>SspI 
I 

5710 5720 5730 5740 I 5750 5760 5770 5780 5790 5800 

QGQCGACACGGAAATGTTGAATACTCATACTCTTOCTTTTTCAAT^ 
CCCGCTffrQCCTTTACAACTTATGACTATGAflAAGCAAA 

5810 5820 5830 5840 5850 5860 

TATTTAGAAAAATAAACAAATAGCXSGTTCCGCQCACATTTCCCCGA^ 
ATAAA lVlUVl ' l A lU ' lVri ' l ' A TCCCCAAOG C GCGTCTAAACCGGCTTTTCA^ 
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J 



_210_ 



_220_ 



1130_903 TO 1335 OP A. PCV/HB. SIGBY-WNVCHU_70_ 

_230 2_5 TO 437 OP SIGBY-WNVCHU* 270 Z 



_280_ 



—1190, 
_290 



_1200> 
_300 > 



1310 122 0 1230 1240 1250 1260 1270 1280 1290 1300 

GAACCACCTCCTCIXDCTTCAAflAAGCAO C TGGCeACCCT^ 
C TTCGT OC A C Ca C TCQAA imO ' lWllljA CCCGT^^ 

XHLLSFKKBLGTLTSAIWRRSSKQKKRGGKTGI> 

WNVCHU* > 

_1230_903 TO 1335 OF A. FCV/HB. SIGHT -WNVCHU_70_ 



_330_ 



_3_5 TO 437 OP SIGBY-WNVCHU*. 



-370. 



1280, 

_380 



_1290_ 
_390 



1300> 

_400 > 



>BstBII 

I 

>PaeR7I | 
I I 
>MotI >XhoI I 

I I I 

>Noc I_in t roduc t i on_ [ Spl i t ] 
II I I 

1320 1330 || 1340) | 1350 



>BstBI 

I 

>V5_epitope 

I I 

113601 1370 




_903 TO 1335 OP A. PCV/HB.SIGE 
5 TO 437 OP SIGBY-WNVCHU*_ 



>BGH_polyadenylat ion_s ignal 
I 



1500 



>AgaI >Pmel 

I I 
> Po lyhi a t i dine_tag | 

I I I 

11410 | 1420 1430 11440 1450 I 1460 1470 1480 1490 

CGCGTACCGGTCATCATCACCATCACCATTGAGTTTAA^ 

GCGCATOGCCAGTAGTAGTGGTAGTGOTAAC7XAAATTTGGGCGACT 

1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 

CGTGCCTTCCTTGACCCTGGAAGGTC^ 
GCACGGAAGGAACTGGGACCTTCCACGGTGAGGG^ 

>BbSl 

I 

1610 1620 1630 1640 1650 1660 1670 1680 1690 1700 

QGGGGTGGCgroGGGCAGGACAGCAAGGGCGAGGATTOGGAACACA^ 
CCCCCACXXX^CCCCGTCCTgrCW^^ 



1710 



1720 



GGTCGACCCCGAGATCCCCCAT. 



1730 1740 1750 

kTCCCCACGCGCCCTGTAGCGGCGCATTA, 



1760 1770 1780 1790 1800 

TACGCGCAGCGTGACCGCTACACTTGCCAG 
ATGCGCGTCGCACTGGCGATGTGAACGGTC 



1810 1 820 1830 1840 1850 1860 1870 1880 1890 1900 

CrcCCTACCGCC^^ 

>DraIII 

1910 1920 1930 1940 1950 1960 1 1970 1980 1990 2000 

CGATTTAGTGCTrTACGGCXCCra 

GCTAAATCACCAAATOCCGTGGAGCTGGGGTTTTTTGAACTAAT^ 

2010 2020 2030 2040 2050 2060 2070 2080 2090 2100 

CGTTGCACTCCAlUlUVm'AATACTOGX C ^ 
CCAACCTCACGTGCAAeaAACTATCSCCTGA^ 

2110 2120 2130 2140 2150 2160 2170 2180 2190 2200 

GAVI'llJj GCC T A ' ri l jAff T AAAAAAT GAGCTGATTTAAC 
CTAAAOCCQGA TA A C CAAaTTTrTACTCCACTAA ^^ 

2210 2220 2230 2240 2250 2260 2270 2280 2290 2300 

GCTCCCGAOQCAQQCMAACTATGCAAAGCATGCATCTCAATTA 
CGAGWnCCOTCCOTCTTCATACGTTTCGTACCT 

2 310 2320 2330 2340 2350 2360 2370 2380 2390 2400 

CCATGCATCTCAATTAOTCAQCAACCATA^ 
CGTACGTAGAGTTAATCAGTOSTTGCTATCAGCrc^ 



2410 
ACTAATTTTTT 



2420 
TATTTAT 



TGATTAAAAAAAATAAATA 



>AvrII 
I 

>StuI 
II 

2450 2460 2470 2480 | I 2490 2500 

TATTCCAGAAGTACrniAGGAGG C lU'l'lM*lt3 GAGGCCT AGG<rrTTTGCAAA 
ATAAGGTCTTCATCACTCCTCC<yVAAAAACCT^^ 



>Smal 
I 

>XmaI >BsaBI 
I I I 
^12510 25 20 2530 2540 2550 | 2560 2570 2580 2590 2600 

AAGCTCCCGOGAGCTTGTATATCCATTTTCGGATCTGATCAAGAGACAGG 
TTCOAQCqCCttTOAACATATAGGTAA^ 
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2610 2620 2630 2640 2650 2660 2670 

C(^CGCTTW»TOa*GXG«rrATTCG<^ 

GCCGGCGAACCCACCTCTIX^TAAGCXGATACTGJ^CCGTCTTCTC 



>Bh6l 

I 

>NarI 

II 

>KasI 

I I I 

2690 I I 12700 
GTCAGCGCAGGGGCGCCC 



CCAAG 



>PstI >MflCl 
I I 
2740 2750 2760 2770 2780 2790 

ATGAACTGCAGGACC^GGCACCGCGGCTATCCnXJGCTGGCCACG 

TACTTGACGTCCTGCTCCGTCOCOCCGATAOCACCGACCGGTGCTGCCCGCAAX3GAACGCCT 



2800 



>TthlllI 

I 

2810 | 2820 2830 2840 2850 2860 2870 2880 2890 2900 

< *CTGTGCTCGACCTTCTCACTGAAGC(XKAAGGGACT 
CX^ACGAGCTCX^ACACTCACTTCGC^ 



2910 



2920 



2930 



2940 



2950 



2960 



2970 2980 2990 3000 

■TTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACG 



TTCATAGGTAGTACCCyurTACGTTACGCCGCCaACGTATGCGAA 

3010 3020 3030 3040 3050 3060 3070 3080 3090 3100 

TACTCGGATCGAAGCCCKyrcTTCTCGATCA^ 
ATGAGCCTACCTTCGGCCACAACAGCTAGTCCTACTAGAC^ 

3110 3120 3130 3140 3150 3160 3170 3180 3190 3200 

CCCaaaXX^A^TCTCGTCG"!^^ 
GGG<^nXXGCTCCTA<»GCAGCACTGGCTA^ 



>RsrII 



32101 



3220 



3260 



3270 



3280 



3290 



3300 



3230 3240 3250 

TAGCGTTGGCTA 

ACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTATA^ 

3310 3320 3330 3340 3350 3360 3370 3380 3390 3400 

TATCCXXreCTCCCGATTCGCAGCGCAT^^ 
ATAGaWCGACGGCTAAGCtrrCGCGTAGCG(»A<»TAGC^ 



3440 



3450 



3460 



3410 3420 3430 

GCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTAl 
CGGCTTGGACGGTAGTGCTCTAAAGCTAAGGTGGCG^ 

3510 



3470 



3480 



3490 3500 
ATCATCCTCCAG 
TACTAGGAGGTC 

3520 3530 3540 3550 3560 3570 3580 3590 3600 

CGOGGG^TCTCATGCTCKSACTTCTrcGCCCACCCCAACTTCr 
CCQCCCCTAGACTACGACCTCAAGAAGCQGC7A3GGGTTGAXCA 



>B8tll07X 

I | 

3610 3620 3630 3640 3650 3660 3670 3680 3690 3700 

AA^ii^-i-i-x^JtCT^ 

TTCGTAAAAAAAOTGACGTAAGATCAACACCAAACAGGTTTGAOT 

_ 3710 3720 3730 3740 3750 3760 3770 3780 3790 3800 

ATCAlWlS^TACCTOrnXXnVIVIVAAATTCT 
TACTACCAOTATCGACAAACGACACACTTTAACAATAGGCGAG^ 

3810 3820 3830 3840 3850 3860 3870 3880 3890 3900 

TQAGTGAQCTAACTCACATTAATOGCGTT(XtX"KJ*CrGCC^ 
ACTCACTOGATTCAGTCTAATTAACGaUWCt^^ 



3980 



3990 



4000 



3910 3920 3930 3940 3950 3960 3970 

o/3*CA Qua^ r ri wjr^ ^ 

CCTCatX reC AA»CQCATAACCCOC(aGAA<^^ 

4010 4020 4030 4040 4050 4060 4070 4080 4090 4100 

CGCGCTAATA£GGTTAT^ 

TACACTCGTTTTCCGGTCGTTTTCCGGTCCTTGGCAT 



CCGCCATTATOCCAATACGTCTCTTAGTCCCCTAT 



4110 «120 4130 4140 4150 4160 4170 4180 4190 4200 

CTGGCu'-i-i-i-ivt^TAGGCTCCGCCC^^ 

GACCGCAAAAAGOTATCCGAGGCGGGGGGACTGCTCGTAGTGTTTTT 



4300 



421 ° 4220 4230 4240 42S0 4260 4270 4280 4290 

CGTTKXXXXnXJCAAGCTCCCTC^^ 
GCA ** GOOG< SACCTTCGAa3GAGC^^ 



4370 4380 

CCCCCCGTTL 



4390 



4310 4320 4330 4340 4350 4360 

TCAATGCTCACCCTGTAGGTATCTCAGTTCGGTC2TA 
AGTTACGACTGCGACATCCATAGACTCAACCCACAT 

4410 4430 4440 44 50 4460 4470 4480 4490 4500 

TCCGOTAACTATCGTCTTGAGTCCAACCCGCTAAGACACGACTT 
AGGCCATTGATACCAGAACTCAflGTOGGCCCATTCTGTOCltSAATAGCGGTGACCG^ 
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